RAW SEQUENCE LISTING 
ERROR REPORT 



BIOTE lOLOGY 
1 SYSiEMS 
BRANCH 




It'' 



DEE 093305 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 09/ ^72^ 6 ^ ^ /? 
Source: 

Date Processed by STIC: //- 2V' OO ' ' 

t 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPjENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2,1 e-mail help: patin21helD@usDto.gOV or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin30help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 




Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program ^tt f i 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings/jj^, - — 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old ^•"Sjfjlji^ 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual ^•^W 
Property Organization (WIPp) Standard ST.25. 

Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer enrored sequence 
listings, thus saving time and money. 



Checker Version 3,0 can be down loaded from the USPTO website at the following address 

http://www.uspto.gov/web/offices/pac/checker 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/ 09/4 72 , 6 8 8A 



DATE: 11/24/2000 
TIME: 15:33:26 



Input: Set : 
Output: Set: 



A <110> 

5 

7 <120> 
10 <130> 

12 <I4 0> 

13 <141> 

15 <150> 

16 <151> 

18 <150> 

19 <151> 

21 <150> 

22 <151> 

24 <150> 

25 <151> 
28 <160> 
30 <170> 



A:\ES.txt 

N:\CRF3\112 4 2000\I4 7 26 8 8A.raw 



Richard A. 



RECEIVED 



APPLICANT: Shimkets 
LtDacli, Martin D. 

TITLE OP INVENTION: cSlngle Nucleotide Polymorphisms for Knov;n Genes 

FILE REFERENCE: 15966 - 53 4 - CIPl 

CURRENT APPLICATION NUMBER: 09/472, 688A 

CURRENT FILING DATE: 1999-12-27 

PRIOR APPLICATION NUMBER: 60/109,024 

1998- 11-17 
NUMBER: 0 9/4 4 3,19 9 

1999- 11-15 
NUMBER: 09/442,129 

1999-11-16 
NUMBER: 09/4 4 2,84 9 
1999-11-17 
NUMBER OF SRQ ID WOS : 880 

SOFTWARE: CuraGen Patent Formatter Version 0.9 



PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING Dz\TE 



Does Not Con^PW . 
corrected DisKette Need J 



ERRORED SEQUENCES 

114 40 <210> SEQ ID NO: 568 ^ 

114 41 <211> LENGTH :/r4^ / / U i'>i 

1144 2 <212> TYPE; PIf 



11443 <213> ORGANISM: Homo sapiens ^ J 1/ / ' p V^^^ 

11445 <220> FEATURE: — ^ / 7 / ^ ^ 



114 45 <220> FEATURE 
11446 <221> NAME/KEY: VARIANT ^ . 

<222> LOCATION: (7)...(0) ^ ^ /? ^ 

<22 3> OTHER INFORI'^ATXON : cSNP translation 
SEQUENCE: 6 68 



SEQ ID NOi 7 69 




TYPE: PRT *^ 
ORGANISM: Honio sapiens 

12861 <220> FEATURE: 

12862 <221> NAME/KEY: VARIANT 
12 863 <222> LOCATION: (7)...(0) 
12864 <223> OTHER INFORMATION: cSNP translation . 

12866 <400> SEQUENCE: 769 ^ <Z A 

12867 Gin Ser Val Val Ser CyS/^"^. / J> ^T? O 

12868 1 ^ ( '1 J 
12954 <210> SEQ ID NO-tn 7 76 

J 2 955 <211> length/ 14 j 

12956 <212> TYPE: — — 

12957 <213> ORGANISM: Homo sapiens 
12959 <220> FEATURE: 



/y //s^y 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/ 09/47 2 , 68 8A 



DATE: .11/2 4/2000 
TIMK: 15:33:28 



12960 
129 61 
12962 
12954 
J 2965 
12966 
13990 
13S91 
13992 
13993 
13995 
1399G 
13997 
13998 
14000 
14001 
14002 
14004 
14 005 
14 006 
14007 
14 009 
14 010 
14011 
14012 
14 014 
14 015 
14016 
14 0 32 
14033 
14 0 34 
14 03 5 
■ 140 3 7 
14038 
14039 
14040 
14 04 2 
1404 3 
14044 
141 16 
14117 
14118 
14119 
14121 
14122 
14:12 3 
14124 
1 4126 
14127 



50 



[-Jomo sapiens 

VARIANT 
(7) ... (0) 



Input S(?t : A:\ES.txt 

Output Set: N:\CRF3\11242000\I472688A.raw 

<221> NAME/KEY: VARIANT 
<222.> LOCATION: (7)...(0) 

<223> OTHi;R TNFORHATTOM: cSNP translation 
<4 00> SEQUmCE: 77 G 
Glu 
1 

<210> 11) 
<211> LENGTH 14 
<212> TYPE: F 
<213> ORGANISM 
<220> FEATURE: 
<221> NAMfcl/KEY 
<222> LOCATION 
<223> OTHER INFORMATION 
<4 00> SEQUENCE: 850 

Phe Phe Lys Arg Asn Arg His Thr Pro Gly Arq Arg 
1 5 10/2. 

<210> SEQ ID ^^8 51 
<211> LENGTH 
<212> TYPE 

<213> ORGANISM: Homo sapiens 
<220> FEATURE: 
<221> NAME/KEY: VARIANT 
<222> LOCATION: (7)...(0) 

<223> OTHER INFORMATION: cSNP translation 
<400> SEQUENCE: 851 
Thr He Gin Pro Pro Arg /JlrTN 

(jtJ 

<210> SEQ ID N0T-Nfi5 3 
<211> LENGTH (14 ) 



cSNP translation 



;th[ 9 J 

i: iW-^ 



Homo sapiens 



<212> TYPE: 
<213> ORGANISM; 
<220> FEATURE: 
<221> NAME/KEY: VARIANT 
<222> LOCATION: (8)...(0) 
<223> OTHER INFORMATION: cSNP 
<4 00> SEQUENCE: 85 3 
Gly Gly Lys Met Gly Gly Arg Lys Arci Leu Gin. 

15 10 
<210> SEO ID Na;^8 59 



translation 



<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<223> 



ys 



Homo sapiens 



LENGTH : 
TYPE: 
ORGANISM: 
FEATURE : 

NAME/KEY: VARIANT 

LOCATION: (7) ... (0) 

OTHER INFORMATION: cSNP translation 
<4 00> SEQUENCE: 8 59 
Thr: Asp Phe Phe Phe 



/V //s/e.c/ 



RECEIVED 

DEC 1 12000 

•IKHCB(TER160a/29OO 
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RAW SEQUENCE LISTING 

PATENT APPLXCATTON: US/09/4 72 , 688A 



DATE: 11/24/2000 
TlMli:: 15:33:28 



input S(?.t : A:\ES.txt 

OlitpuL Set: N:\CRf3\1124 2 0 00\l4 7 26 8BA.raw 



E--> 14128 

14130 
14131 
14 132 
14133 
14 135 
14 13G 
1413 7 
14 138 
14140 
14141 

E--> 14142 
14 214 
14 215 
14216 
14217 
14 219 
14220 
14221 
14222 
14 224 
14 225 

E--> 14226 
14 228 
14 229 
14 230 
14 231 
1423 3 
14 2 34 
14235 
14 236 
14238 
14 23 9 

E--> 14240 
14 2 56 
14257 
14258 
14259 
14 261 
14262 
14263 
14 2G4 
14266 
14 26 7 

E--> 14268 
14270 
14271 
14272 
14273 



1 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<223> 
<400> 



SEQ ID ;#OT^60 
LENGTH :i 
TYPE: P> 
ORGANISM: Homo sapiens 
FEATURE: 

N A Miy K E if : V AR 1 A N T 
LOCATION: (8) ... (0) 
OriiEK JNFORMATTON: cSNP 
SEQUENCE: 860 



13 



trans la t ion 



Gl 
1 

<210> 
<2 :[ i> 

<212> 
<213> 
<220> 
<221> 
<222> 
<223> 
<4 00> 
Asn 
1 

<210> 
<211 > 
<212> 
<213> 
<220> 
<221> 
<222> 
<223> 
<400> 



Ala Gly Ser Val Ser Asp His His Ser lie Tlir 
5 10 
SEO ID U6-rN3G6 
LENGTH :[ 14 J 
TYPE: Pft^^ 
ORGANISM: Homo sapiens 
FEATURE; 

NAHE/K EY: V AR I A NT 
LOCATION: ( 7) . . . ( 0) 

OTilER INFORMATION: cSNP translation 
SEQUENCE: 866 
Phe Tie Leu Ala Cys Pro jfTrg 



/¥ 
8 



3'rH (14) 




SEQ ID ^iia-<^867 
LENGTH 
TYPE 

ORGANISM: Jlomo sapiens 

FEATURE: 

NAi-lE/KEV: VARIANT 
LOCAIION: (7) ... (0) 

OTHER INFORMATION: cSNP translation 
SEQUENCE: 867 



/3 



Arg Glu Lys Leu Arg Asn Phe 

1 5 
<210> SEQ ID NO: 86 9 
<211> LENGTH j^Tl"^ 
<212> TYPE: FSi^rrT^ 
<213> ORGANISM; Homo sapiens 
<220> FEATURE: 
<221> NA14E/KEY: VARIANT 
<222> LOCATION: {7)...(0) 
<22 3> OTHER INFORMATION: cSNP 
<400> SEQUENCE; 869 
Lys Arg val Ala Gly Gly Leu 

1 5 
<210> SEO ID Na-?~^70 
<211> LENGTH jT 14 ) 
<212> TYPE: Nii"^^ 
<213> ORGANISM: Homo sapi.ens 



His Phe ser Met Ser ^^^^^ 
10 



translation 



3 A^<:^^ 
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RAW SEQUENCE LISTING DATE: 11/24/2000 

PATIENT APPLlCAriON: US/09/4 7 2 , 6 8 8A TIME: 15:33:28 

Input: Set : A:\ES.txt 

OutfJuL Set: N:\CRF3\ii242000\I472688A.raw 

14 275 <220> FfJATURE: 

14276 <22.T> NAME/KEY: VARIANT 

14277 <222> LOCATION: (8),..{0) 

14 27 8 <22'3> 01 HER I. NFORl-lAT ION : cSNP translation 

1428U <4 00> SEQlTEiNCE: 870 

14 28.1 Gly Lys Arg Val Ala Gly Gly Leu ^rg 

14282 1 5 

14298 <210> SEQ ID J<rjT^72 

14 299 <21I> length/ 14 J 

14 300 <212> TYPE: k^?^ 

14 301 <213> ORGANISM: Homo sapiens 

14 303 <220> FEATURE: 

14 304 <221> NAME/KEY; VARIANT 

14305 <222> LOCATTOH: (B)...(0} 

14 306 <22 3> OTHER INFORMATION: cSNP 

14308 <400> SEQUENCE: 872 

14 309 Gin Arq Ser lie Ser Ala A^ 

14310 1 ^ M / 

14410 <210> SEQ ID NO: BBO \^ 

14411 <211> LENGTH: 14 

14412 <212> TYPE: PRT 

14413 <2.13> OKGANTSM: Homo sapiens 
14415 <220> FEATURE: 
14 416 <221> N.\£4E/KEY: VARIANT 
14417 <222> LOCATION: (7) ... (0) 

"14418 <223> OTHER INFORMATION: cSNP translation 

14 4 20 <4 00> SEQUENCE: 880 

144 21 Cys Lys Cys Ser Arg Lys Asp Pro Arg Ser Ala Thr Ala Tlir 

__1 ' 't-^ 1^ 5 10 

r^-> 14424 15 

-> 14433 2 



I Axg 

0) 

translation ^ ^ I? O CmJ ^ 




Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least one n or Xaa. 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/4 7 2 , 6 8 8A 



OATC: 11/24/2000 
TIME: 15:33:29 



Input: Sot I A:\ES.txt 

Output Set: N:\CRF3\il242000\I472688A.raw 



L 


4 7 


^1 : 


1 1 o 




(48) 


String data converted 


to 


lower case, 


L 


u 4 


A '. 


1 1 'J 

L J. i 




( 40 ) 


Strinc] data converted 


to 


lower case, 


L 


8 1 


*i : 


i. 1 2 


: ■ 

^ : 


(48) 


Strinq data converted to 


Lower case, 




9 8 


4 '. 


L.L -£ 




( 45 ) 


St:rincj data converted to 


lower case, 


r' 


J. ± J 


M 


* 11-* 




f 4 8 


StriiKj 




^ ^ rv T/ r* t" /-I rT 
' 1 J I i V t: I, U t ' U 


to 




case, 


L 


132 


H 


■ 1 1 O 


c 




S u r Log 


d n t a 




to 




case. 


L 


1 ^1 G 

i 4 y 


M 


• 1 1 O 
, 1 I. Z 


c 


{AG 


■ S tr Iricj 




con V er ted 


to 


.^0 rfe r 


case, 


L 


i t> u 


M 


: 112 


^ 


(48' 


S ti" 1119 


d-'t-^ 


cori V er tfc? ti 


1 0 


I*^i^e^ 


case. 


L 


i o J 


M 


. .L.Li 


c 


('to 


S t r intj 


d-^ti 


converted 


to 


1 "wer 




L 


200 


11 


: 11 2 


c 


(40 


String 






1 0 


1 w ■ 


case. 


L 


2 7 


M 


: 11 2 


c 


(48 


S tr incj 


data 


CO n ver ted 




ower 


case, 


L 


^ i 4 


M 


; 1 1 2 


c 


{48 


S tr 1 ng 


da ta 


con ve rteci 


to 


. ower 


case, 


1. 


2 51 


M 


; 112 


c 


( 4 b 


String 


data 


conv ert ed 


to 


ower 


case. 


L 


268 


M 


: 11 2 


c 


(48 


String 


data 


c 0 n V e r t. e o 


to 


ower 


c ti s e , 


L 


28 5 


M 


' 112 




(48 


S tr 1 ug 


da td 


converted 


LO 


ovver 


case. 


L 


3 02 


M 


.112 




(mo 


String 


d a t a 


converted 


to 


1 owe r 


case , 


L 


319 


M 


112 




(48 


String 


data 


converted 


to 


lower 


case f 


L 


3 3 6 


M 


112 


c 


(48 


String 


data 


conv erted 


to 


lower 


ca se , 


li 


3 5 3 


M 


112 


c 


(48' 


S t r m g 


data 


converted 


to 


lower 


Ccjse , 


L 


3 7 0 


M 


112 




(48' 


String 


da ta 


converted 


to 


lower 


a s e , 


L 


3 87 


M 


1 1 2 




{48 


String 


da ta 


con ve rted 


to 


lowe r 


case , 


T.- 


4 0 4 


M 


112 


c 


(48 


S t r i n g 


data 


converted 


t-O 


lower 


ca se , 


L 


4 2 1 


M 


112 


c 


(48' 


String 


data 


conve rted 


to 


lower 


CcJ se , 


Ij 


4 3 8 


M 


112 


c 


(48' 


S t ring 


data 


converted 


to 


lower 


case / 


L 


4 5 5 


M 


112 


c 


(48 


String 


fi a ta 


con vei'ted 


to 


1 ower 


case, 


T. 


4 7 2 


M 


112 


c 


(48' 


String 


d a t a 


conve rted 


to 


lower 


ca S6 , 


r. 


4 8 9 


M 


112 


c 


( 4 8 1 


String 


data 


conve rted 


to 


lov/er 


ca se , 




506 


M 


112 


c 


(48' 


S t.ring 


d ata 


converted 


to 


lower 


ca se / 


L 


52 3 


H 


112 


c 


{48; 


String 


data 


converted 


to 


lower 


case , 


L 


54 0 


M 


112 


c 


(48; 


String 


data 


converted 


to 


lower 


case , 


T. 


557 


H 


112 


c 


( 48 ; 




data 


l_ LJ 11 V I,, i, Lt . \1 


{' 0 








57 4 


M 


] 12 


c 


( 48 ; 


String 


data 


converted 


to 


lower 


case , 


L 


591 


M 


112 


c 


(48; 


String 


data 


converted 


to 


lower 


case , 


t. 


6 08 


M 


112 


c 


(48; 


String 


data 


converted 


to 


lov/er 


case , 


■[ 


6 2 5 


M 


-j 


^ 


(48' 


String 


data 


converted 


to 


lower 


case , 


L 


642 




112 


c 


{48) 


String 


data 


converted 


to 


lov/er 


case , 


L 


659 


M 


112 


c 


(48) 


String 


data 


converted 


to 


lower 


case , 


L 


676 


M 


112 


c 


(48) 


String 


data 


converted 


to 


lower 


case , 


L 


6 9 3 


[-1 


112 


c 


(48) 


String 


data 


converted 


t,o 


lov/er 


case , 


L. 


710 


M 


112 


c 


(48 ) 


String 


data 


converted 


to 


J.ov/er 


case. 


L: 


727 


M 


112 


c 


(48) 


Stri ng 


data 


converted 


to 


1 owe r 


case , 


L: 


744 


M 


112 


c 


(48) 


String 


data 


converted 


to 


1 ower 


case , 


L: 


761 


M 


112 


c 


(48) 


String 


data 


converted 


to 


lower 


case , 


L: 


778 


M 


112 


c - 


{48 ) 


String 


data 


converted 


to 


lov/er 


case , 


L: 


795 


M 


i:i2 


c- 


(48 ) 


Stri ng 


data 


converted 


to 


1 ower 


case , 


L: 


812 


M 


112 


c 


(48 ) 


Stri ng 


data 


converted 


to 


lower 


case , 


L: 


829 


H 


112 


c- 


(48) 


String 


data 


converted 


to 


lower 


case , 


L: 


8 46 


M 


112 


C: 


(48) 


String 


data 


converted 


to 


lower 


case , 



file://C:\CRF3\Outhold\Vsii472688A.htm 



1 1/24/00 



4 



Page 6 of 7 



VERIFICATION SUMMARY DATE: .11/24/2000 

PATFJMT APPLXCATTOM: US/09/4 7 2 , 6 8 8A TIME: 15:33:29 

Input Set : A:\ES.txt 

Output Set: N:\CRF3\11242000\I472588A.raw 



L:8G3 M:112 C: (48) String data converted to lower case, 
L:880 M:112 C: (48) String data converted to lower case, 
L:6357 M:341 W: (46) "n" or "Xaa" used, for StlQ IDS: 3 70 
L : 6 9 5 Q M : 3 4 1 Vv : (45) " u " or " Xaa " used, I'or SEQ T D « : 4 04 



L 


114 5 2 


M 


252 


E 


No. 


of 


Seq . 


di f fers , 


<211> LENGTH 


: Input 


14 


Found 


: 1 


SEQ: 668 


L 


12B63 


M 


252 


E 


Mo . 


of 


Seq . 


dif fers , 


<211> LENGTH 


: Input 


14 


Found 


: 7 


SEQ: 709 


L 


12966 


ti 


252 




No. 


of 


Seq . 


differs , 


<211> LENGTH 


; Input 


14 


Found 


: 1 


SEQ: 7 76 


L 


13805 


H 


341 


W 


(46) 


"n" or 


"Xaa" used, for SEQ 


IDs : 83C 










L 


14 002 


M 


252 




No. 


of 


Seq . 


differs , 


<211> LENGTH 


: Input 


14 


Found 


: 12 


SEO: 850 


L 


14 016 


M 


252 


E 


No. 


of 


Seq . 


di f f ers , 


<211>LENGTH 


: Input 


9 Found: 


7 SEQ: 8 51 


L 


14044 


t-1 


252 


n 


No . 


of 


Seq . 


differs , 


<211>LENGTH 


: Input . 


14 


Found 


; 12 


SEQ: 8 53 


L 


14128 


M 


252 


E 


No. 


of 


Seq . 


differs , 


<211> LENGTH 


: Input . 


14 


Found 


:6 


SEQ: 859 


1. 


14 14 2 


M 


252 


E 


No. 


of 


Seq . 


di ffers , 


<211> LENGTH 


: I nput : 


,14 


Found 


; 13 


SEQ: 860 


L 


14 226 


M 


252 


E 


NO . 


of 


Seq . 


differs , 


<211> LENGTH 


: Input : 


14 


Found 


: 8 


SEQ: 8 66 


L 


14 240 


H 


252 


E 


No. 


of 


Seq . 


differs , 


<211>LENGTH 


: Inpu L : 


14 


Found 


: 13 


SEQ: 8 67 


L 


14268 


M 


25 2 


E 


No . 


of 


Seq . 


dif fers , 


<211>LENGVH 


: Input : 


14 


Found 


: 8 


SEQ: 869 


L 


14282 


M 


252 


E 


No. 


of 


Seq . 


differs , 


< 2 11> LENGTH 


: Input : 


14 


Found 


:9 


SEQ: 870 


L 


14 310 


M 


252 


E 


No. 


of 


Seq. 


differs , 


<211>LENGTH 


: Input : 


14 


Found 


:7 


SEQ: 872 


L 


14424 


H 


332 


E 


(32) 


in V a 11 d /M i s s i n 9 


Amino Acid Numbering 


SEQ 


ID : 


880 



M;332 Repeated in SeqNo = 8 8() 
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